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Abstract 

It is proved that among the rational iterations locally converging with order 
s > 1 to the sign function, the Pade iterations and their reciprocals are the 
unique with the lowest sum of the degrees of numerator and denominator. 
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1. Introduction 

The function s{z) = sign(z) is defined for a nonimaginary complex num- 
ber z as the nearest square root of unity. Let A be a matrix having no purely 
imaginary eigenvalues. Since s{z) is analytic at the eigenvalues of A, matrix 
function theory |5|] allows one to define sign(y4). The matrix sign function is 
less trivial than its scalar counterpart, for instance it is not locally constant, 
and it has important applications, either direct like the solution of algebraic 



Riccati equations [12[ and the treatment of certain quantum chromodynam- 
ics models 19| or indirect as a basis to compute other important matrix 
functions like the matrix square root, the polar decomposition of a matrix 
and the geometric mean of two positive definite matrices jsl, Isj . 

A common way to compute the matrix sign function is through rational 
iterations of the form Zk+i = ^p{zk), for some rational function (p{z) having 
attractive fixed points at 1 and —1, since any such iteration converges locally 
to the function sign(z). The prototypical example is Newton's method for 
z"^ — 1 = 0, but many other iterations have been proposed. Among them, a 
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very popular family is obtained by using the Pade approximants to /(^) 
(1 — 0"^^^ and the following characterization 



sign(zj 



[^2)1/2 (1-0^/2 



where ^ = Let the (m, n) Pade approximant to /(^) be Pm,n(yO / Qm,n{0 y 

and m + n ^ 1. The iteration 

has been proved to be locally convergent to 1 and —1 with order of conver- 



gence m + n+ lfoim^n — 1 [10[. The notation (f2m+i,2n introduced here 
highlights the fact that the numerator and denominator of ip2m+i,2n have 
degree 2m + 1 and 2n, respectively. 

We recall that, for integers m,n ^ 0, the {m,n) Pade approximant to 
a function h{z) is a rational function p{z)/q{z), where p{z) and q{z) are 
polynomials of degree m and n, respectively, such that 

h(z)-^ = 0{z^^^^'). 

(1[Z) 

For an introduction to the Pade approximation see the book [ij. 

The iterations ([1]) have been derived by Kenney and Laub [13] and are 
called Pade family of iterations or just Pade iterations; they have been con- 
sidered also in js], 0, [si 14, 20] for computing matrix functions or invariant 



subspaces of a matrix. Observe that the definition of Pade iterations in [10 
is slightly different from ours, since we exclude the case m = n = 0, which 
yields the trivial iteration Zk+i = '^i,o{zk) = Zk being not locally convergent 
to 1 and —1. 

Using the identity 

sign{z) = = , 

z z 

and the Pade approximants to = (l—^Y^'^, a different family of iterations 
having attractive fixed points at 1 and —1 is obtained. If p{z)/q{z) is the 
{m,n) Pade approximant to the function h{z) and h{0) ^ 0, then q{z)/p{z) 
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is the {n,m) approximant to l/h(z) (see [l], Theorem 1.5.1]), thus the {m,n) 
Pade approximant to g{^) is Qn,m{0 / ^n,m{0 ■ The iteration 

_ Qn,m(l ~ ^fc) / X /„\ 

— 75 _ 2\ ~- V2m,2n+l[Zk) {^) 

is obtained. We call the iterations (|2]) reciprocal Pade iterations or recipro- 
cal Pade family. The possibility to invert the functions defining the Pade 
iterations is suggested as well by Laub in [l5j without further discussions. 

Many iterations of interest can be retrieved in the Pade family and its 
reciprocal: ip2,i, f3,o and ip3^2 give Newton's method, the Newton-Schulz 
iteration and Halley's method for — 1 = 0, respectively. Among the Pade 
family ([1]), the most common iterations are those with constant denominator 
(i.e., n = in ([T])), and the so-called principal Pade iterations, namely, those 
for which the degrees of the numerator and denominator differ by 1 (i.e., 
m = noim = n — 1 in ([1])). Similarly, the reciprocal Pade family contains 
iterations with constant numerator (m = in (|2])) or for which the degrees 
of the numerator and denominator differ by 1 (m = n or m = + 1 in ([2]) ) . 

Prior to Kenney and Laub, the Pade iterations for m = n — 1 and the 
reciprocal Pade iterations for m = n have been derived in a different way 
by Rowland [6|; lannazzo [7| proved that the same iterations obtained by 
Rowland can be retrieved in the family of root-finding algorithms sometimes 
called Konig family [2(|or basic family |9| and attributed |2| to a paper written 
by Schroder in 1870 [l6|, [l7 . 

The Pade family and its reciprocal family are just two of the infinite fam- 
ilies of rational iterations having the square roots of unity as attractive fixed 
points. A rationale for their use can be given by their interesting properties 
(see js], [lil for the case of the Pade family; some analogous properties hold for 
the reciprocal Pade family). We show that they also have an optimality prop- 
erty: among all rational iterations having order of local convergence s > 1 at 1 
and —1, they are the unique iterations such that the sum of the degrees of the 
numerator and denominator is minimal. This is a highly desirable property in 
terms of computational efficiency: in the generic case it is cheaper (in terms 
of the number of arithmetic operations required) to evaluate a{z)/b{z) than 
a{z)/b{z) when deg(a(2;)) + deg(6(z)) > deg(a(z)) + deg(6(z)) and Rorner's 
scheme is applied. 

Let s, m, n be nonnegative integers such that s > 1 and m + n = 2s — 1. 
Observe that, letting m,n vary, the family z/^^i = ^mn{zk) is the union of 
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the Pade family and its reciprocal family; the parities of m and n distinguish 
one from the other. The following property of iprnn{z) is the main result of 
the paper and will be proved in the next section. 

Theorem 1. Let s > 1 be a nonnegative integer. The functions (pmn{z), for 
m = 0,1, ... ,2s — 1 and n = 2s — m — 1 define the unique rational iterations 
of the kind Zk+i = (p{zk) such that 

01 the iteration converges locally to 1 and —1 with order at least s; 

02 for every iteration Wk+i = ^{wk) = a{wk)/b{wk), witha{z),b{z) polyno- 
mials, having order at least s in both 1 and —1, it holds that deg(a(2;)) + 
deg{b{z)) ^ deg(a(z)) + deg{b{z)), where a{z) and b{z) are coprime 
polynomials such that (f{z) = a{z)/b{z). 

Moreover, the iterations have order exactly s in both 1 and —1. 

We consider just the case s > 1 for two reasons: first, in matrix functions 
computation, algorithms based on rational iterations are competitive if they 
converge fast, that is if they are of order at least 2; second, if s = 1 a direct 
computation shows that the unique iterations satisfying Ol and 02 are the 
same as the ones obtained for s = 2. 

It is worth noting that a rational iteration satisfying Ol and 02 for some 
s > 1 is not necessarily the one whose iteration function can be evaluated with 
the minimal cost. In principle, there can be a special rational function which 
does not satisfy 02 and can be evaluated with fewer arithmetic operations. In 
addition, the same iteration can be evalutated with many different schemes 
yielding different computational costs, relevant in the matrix case (see js], 
Chapter 4]). For the principal Pade iterations, a partial fraction expansion 
[g], [llj and a continued fraction expansion |l3| are known and can be used 
to devise efficient evaluation schemes as in ^S, Algorithms 4.9-4.10]. 

For the sake of clarity, we recall some basic definitions regarding iterations 
of the kind Zk+i = <^{zk), where ^p{z) is a rational function and IS 8b fixed 
point of ip{z), that is, (p{z^:) = z*. We say that z^ is an attractive fixed point 
if < 1; in that case the iteration is locally convergent to z^, that 

is, any initial value Zq sufficiently close to z^ yields a sequence converging 
to z^:. We say that the iteration converges locally to z,, with order s > 1 
if there exist Mi,M2 > such that for z sufficiently close to z^, it holds 
that Mi\z — ^ \ ~ ^ M2\z — z^l'^. Since y^{z) is infinitely many 
different iable at z*, this is equivalent to requiring that 

^{z,) = z,, ip'iz,) = ip"{z,) = ■■■ = v^'-'\z,) = 0, <^(^)(z,) ^ 0. (3) 
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In particular, an iteration having order s > 1 at 2* is locally convergent to 
2;*. Further discussion on this topic can be found in any numerical analysis 
textbook, for instance 18 . 



2. Proof of the Theorem [T] 

The proof of Theorem [1] is given by some Lemmas. We first prove that im- 
posing ([3]) for a rational function ip{z) = a{z)/b{z) is equivalent to imposing 
some conditions on the polynomials a{z) and b{z) and their derivatives. Then 
we prove that they can only be satisfied if deg(a(z)) + deg(6(z)) ^ 2s — 1, 
with equality only for a unique family of polynomials. Finally, we prove 
that these unique solutions correspond to the Fade family and its reciprocal 
family. 

Lemma 2. Let s > 1 be an integer, Zk+i = ^p{zk) = a{zk)/b{zk) a rational 
iteration, and z^ be one of its fixed points (in particular, 7^ Oj. The 

iteration converges locally to with order at least s if and only if 

a^^\z,) = z,b^^\z,), for k = 0,1,..., s-1. (4) 

If moreover a^^\z^) 7^ z^b^'^\z^) then the order is exactly s. 

Proof. The rational iteration converges locally to with order at least s if 
and only if for z sufficiently close to 2;* 

y,{z)-z, = 0{iz-z,y). (5) 

Since b{z) is bounded in a neighborhood of z^,, we may multiply the left-hand 
side of by b{z) without changing its convergence behavior, thus obtaining 

a{z) ~ z,b{z) = 0{{z~ z,y), (6) 

which in turn is equivalent to (jlj). As 7^ 0, b{z) is bounded away from 
in a neighborhood of z^,, thus we may divide the left-hand side of (jH]) by 
b{z) to reverse the previous step and get (jS]). 

If a^'^\z^:) 7^ z^b^^\z^), then does not hold anymore if we replace s 
with s + 1, and neither does ([5]), i.e., the convergence order is exactly s. □ 

Using conditions (jl]), we may prove the following bound on the degrees 
of a{z) and b{z). 
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Lemma 3. Let s be a positive integer, and a{z),b{z) two polynomials, not 
both null, such that 

-''^i^) = b^'^i^)^ k-oi s-1 (7) 

aW(_l) = _5(fc)(_l), fc-U,l,...,s 1. (7j 

Then deg(a(2;)) + deg{b{z)) ^ 2s — 1, with the convention that the degree of 
the zero polynomial is —1. 

Moreover, for each pair (m, n) of integers such that m + n = 2s — 
1, min(m,n) ^ —1, there are two polynomials amn{z) , bmn{z) such that 
deg{amn{z)) = m, deg{bmn{z)) = n and the conditions ([7]) hold. The polyno- 
mials amn{z) and bmn{z) are unique up to a multiplicative factor, and 

^ ^ -h^n{-^)- (8) 

Proof. First notice that we may impose without loss of generahty that 
deg(a(z)) ^ deg(6(z)) throughout the proof. 

We shall prove the result by induction. For s = 1, the result is clear. The 
conditions that we must meet are a(l) = 6(1), a{—l) = — 6(— 1). Thus a{z) 
and b{z) cannot be both constant, and the only possibility with deg(a(z)) = 
1, deg{b{z)) = is choosing b{z) = 7, a{z) = ■yz for some constant 7 7^ 0, 
while the only possibility with deg(a(z)) = 2, deg(6(2;)) = — 1 is choosing 
b{z) = 0, a{z) = ■y{z — l){z + 1) for some constant 7 7^ 0. 

Let us suppose that the lemma holds true for a given s — 1, and prove it for 
s = s. Let us take two polynomials a{z), b{z) such that a^''\±l) = ±6^^)(±1) 
for all = 1, 2, . . . , s — 1. If b{z) = 0, then a{z) must be a multiple of both 
{z — ly and {z + 1)**, thus it has degree at least 2s and the result holds. If 
b{z) 7^ 0, we may apply the inductive hypothesis to their derivatives a^^\z) 
and b^^\z), and obtain deg(a*^"'^^(2;)) + deg(6*^^^(2;)) ^ 2s — 3. This clearly 
implies deg(a(z)) + deg(6(z)) ^ 2s — 1, since the derivative of a polynomial 
p{z) has degree deg{p{z)) — 1 (notice that this relation holds also for constant 
polynomials p{z) = c 7^ 0, with our choice deg(O) = —1). 

Let us turn now to the equality case; we shall prove the uniqueness first, 
and the existence thereafter, of the two families of polynomials attaining the 
minimal degrees. 

Let m,n be such that m + n = 2s — 1, and amn{z) and bmn{z) be two 
polynomials with deg{amn{z)) = m, deg{bmn{z)) = n satisfying ([7]). If n = 
— 1, then a{z) must be a polynomial multiple of both [z — 1)* and {z + 1)* 
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of degree 2s, and thus 

02^,-1(2;) = k{z-iy{z+l) 
b2s,-i(z) = 



(9) 



for some k ^ 0. If n ^ —1, then amn{z) and bmn{z) satisfy the equahty 
conditions of the lemma with s = s — 1; thus by the uniqueness result it must 
be the case that 

^mni^) = kam-l,n-l(yZ), ^^^^ 
^mn(^) = ^&m-l,n-l(^), 

for some k 0. From ([TOl) we get a.mn{z) = amn(±l) + k J^^am-i,n-i{t)dt 
and6„„(z) = 6„„(±1) + A; Jl^bm~i,n~i{t)dt. Imposing am„(±l) = ±6,„„(±1), 
a simple manipulation of the resulting system gives 



a^r^iz) = kA{z) + h{B{l) - B{-1) - A{1) - A{-1)), 
brnniz) = kB{z) + h{A{l) - Ai-1) - 5(1) - Bi-1)), 



with A{z) (resp. B{z)) a primitive of am-i,n-i{z) (resp. bm-i,n-i{z)) . It is 
now apparent that the two polynomials are uniquely determined up to the 
multiplicative constant k. From the inductive hypothesis a^l}^^_]^(±l) 7^ 

±^ll!n-i(±l)> it follows that a^^n{±l) ±b^l{±l). 

On the other hand, one can easily check that the polynomials defined by 
the formulas Q) and (fTT]) have degree deg(amn(-2)) = m, deg{bmn{z)) = n 
and satisfy ([7]). So said polynomials exist. □ 

Proof of TheoremUl We prove the theorem for a fixed s > 1. 

We first show that for each m,n ^ such that m + n = 2s — 1 the 
iteration Zk+i = ^mn{,Zk) is the unique rational iteration of the kind Zk+i = 
a[zk)/b{zk) such that deg(a(2)) = m and deg{b{z)) = n, satisfying 02 and 
whose order of local convergence is exactly s (thus it satisfies Ol as well). 
Then, we show that any rational iteration satisfying Ol and 02 is of the 
type Zk+i = (fmnizk) for m + n = 2s - 1. 

Let m and n be such that m + n = 2s — 1, with m,n ^ 0; then, by 
LemmaO there are two polynomials amn(^) and bmn{z) such that deg(amn(2;)) 
m, deg{bmn{z)) = n satisfying ([7]) and ([8]). Since s > 1, Lemma [2] implies 
that for i)rnn{z) ■■= amn{z)/bmn{z) the iteration Zk+i = ipmn{zk) converges 
locally to 1 and —1 with order exactly s, thus ipmn satisfiies Ol. 
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On the other hand, consider an iteration function ipi^) = a{z)/b{z) pro- 
viding a sequence converging with order at least s. By Lemma HI it fol- 
lows that conditions ([7]) hold and thus deg(a(2;)) + deg{b{z)) ^ 2s — 1 = 
deg(am.n(-2)) + deg{bmn{z)) by Lemma [31 Therefore the iteration Zk+i = 
il'mnizk) satisfies 02. By the same lemma, equality holds if and only if a{z) 
and b{z) differ from amn{z) and bmn{z) by the same multiphcative factor, i.e., 
when ip^z) and 'ipmni.z) coincide. Thus, this is the unique iteration satisfying 
both Ol and 02 of the kind Zk+i = a{zk)/b{zk) with deg(a(z)) = m and 
deg(6(2;)) = n. 

Now we show that ipmn{z) coincides with iprnn{z)- Since the numerator of 
^mn{.z) has degree m and its denominator has degree n, it is enough to prove 
that Zk+i = fmn{zk) satisfies Ol (thus 02, in view of Lemma [3]). 

a) Odd m, m = 2mi + 1, n = 2ni. Let hf,e{() := P^e{C) / Q ^e{C) be the 
{fi,i) Fade approximant to (1 — C)"^''^- Then, in a neighborhood of C = 0, 

V(C)-(i-C)-'/' = 0(C'^+^+i). 

Since ipmn{z) = zhm^^mi'^ - z^) and mi + ni = s - 1, we get '^mni.z) - 
zi^z"^)^^/"^ = 0((1 — z'^Y), for z sufficiently close to 1 or —1. Since (1 — z'^Y = 
0{{z — 1)'') and (1 — z'^Y = C){{z + lY) in a neighborhood of 1 and — 1 
respectively, it holds that 

'^mn{z) — 1 = 0{{z — lY): for -2 in a neighborhood of 1, ^^^^ 
fmn{z) + 1 = 0{{z + 1)''), for z in a neighborhood of — 1, 

and then z^+i = (prnn{z) verifies Ol. 

b) Even m, m = 2mi, n = 2ni + 1. By a reasoning similar to case a, 
we use the (/x, Fade approximant to (1 — CY^'^^ say h^i(Q, and Lpmn{z) = 
hmi.nii^ — z'^)/z, thus ipmn{z) Verifies (IT^ as well and Zk+i = ipmniz) verifies 
Ol. 

Thus, for each m, n such that m + n = 2s — 1, we have (pmn{z) = ipmn{z) 
and the iteration Zk+i = ^mn{zk) is the unique iteration satisfying Ol and 
02 and whose numerator has degree m and denominator has degree n. 

Let Wk+i = a{wk)/b{wk) be a rational iteration satisfying Ol and 02. 
Let m' = deg(a(2;)) and n' = deg(6(z)). By Lemmas [2] and [3] one has m'+n' ^ 
2s — 1. We claim that m' + n' = 2s — 1; if on the contrary m' + > 2s — 1 
then there exist m ^ m' and n ^ n' such that m + n = 2s — 1 and Zk+i = 
^mn{zk) satisfies Ol, thus Wk+i = a{wk) /b{wk) cannot satisfy 02 and we 
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get a contradiction. Finally, by the aforementioned uniqueness result we 
conclude that a{z)/b{z) must coincide with (pm',n'iz) up to a multiplicative 
factor. □ 
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